Determining the Origin of Downloaded Files Using Metadata Associations

نویسندگان

  • Sriram Raghavan
  • S. V. Raghavan
چکیده

Determining the “origin of a file” in a file system is often required during digital investigations. While the problem of “origin of a file” appears intractable in isolation, it often becomes simpler if one considers the environmental context, viz., the presence of browser history, cache logs, cookies and so on. Metadata can help bridge this contextual gap. Majority of the current tools, with their search-and-query interface, while enabling extraction of metadata stops short of leading the investigator to the “associations” that metadata potentially point to, thereby enabling an approach to solving the “origin of a file” problem. In this paper, we develop a method to identify the origin of files downloaded from the Internet using metadata based associations. Metadata based associations are derived though metadata value matches on the digital artifacts and the artifacts thus associated, are grouped together automatically. These associations can reveal certain higher-order relationships across different sources such as file systems and log files. We define four relationships between files on file systems and log records in log files which we use to determine the origin of a particular file. The files in question are tracked from the user file system under examination to the different browser logs generated during a user’s online activity to their points of origin in the Internet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data and Methods for the Production of National Population Estimates: An Overview and Analysis of Available Metadata

Thomas Spoorenberg Translated by: Elham Fathi Statistical Center of Iran Abstract. Official population estimates can be produced using a variety of data sources and methods. These range from the direct extraction of information from continuously updated population registers to procedures for updating the status of a population enumerated previously in a periodic census. Additional sources and ...

متن کامل

Just a Click Away: Social Search and Metadata in Predicting File Discovery

Social search has been claimed to improve content discovery by allowing users to draw on their social network to find relevant content. Thus social network information, complemented with metadata, can enhance the search for new information. We examine the relative contribution of social network information and file metadata in predicting downloads of files by analyzing the file browsing behavio...

متن کامل

مقایسۀ مدخل‌های استانداردهای فراداده‌ای در پایگاه‌های نسخه‌های خطی فارسی با مدخل‌های استانداردهای فراداده‌ای در پایگاه‌های خارج از ایران در پوشش مدخل‌های نسخه‌های خطی

Purpose: The present research aims at studying the use of metadata standards in Persian manuscripts databases, and the types and frequencies of these standards in the Optical Character Recognition (OCR) procedure of these databases. Methodology: Research population consists of four Persian databases and 12 Latin databases. The research data is gathered through a checklist, using descriptive su...

متن کامل

Spyglass: Fast, Scalable Metadata Search for Large-Scale Storage Systems

The scale of today’s storage systems has made it increasingly difficult to find and manage files. To address this, we have developed Spyglass, a file metadata search system that is specially designed for large-scale storage systems. Using an optimized design, guided by an analysis of real-world metadata traces and a user study, Spyglass allows fast, complex searches over file metadata to help u...

متن کامل

Reconstructing Tabbed Browser Sessions Using Metadata Associations for Multi-Threaded Browser Implementation

Today, Internet browsers support multiple browser tabs, each browser tab capable of initiating & maintaining a separate web session, accessing multiple URIs simultaneously. As a consequence, the network traffic generated as part of a web request becomes indistinguishable across tabbed sessions. But one can find the “specificity of attribution” in the session-related context information recorded...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JCM

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2013